Tangerine: a large vocabulary Mandarin dictation system

نویسندگان

  • Yuqing Gao
  • Hsiao-Wuen Hon
  • Zhiwei Lin
  • Gareth Loudon
  • S. Yogananthan
  • Baosheng Yuan
چکیده

The text input for non-alphabetic languages, such as Chinese, has been a decades-long problem. Chinese Dictation using large vocabulary speech recognition provides a convenient mode of text entry. In contrast to a character based Dictation system [5], a word-based Mandarin dictation system has been designed [3] (based on Apple's PlainTalk speech recognition technology [4]) for efficient entry of Chinese characters into a computer. In this paper new features and improvements to the dictation system are presented. The new features and improvements have produced an overall reduction in recognition error of 50 80%. The vocabulary has also been increased from 5000 words to over 11,000 words. The new features are:mel frequency Cepstral analysis, spectral noise subtraction, cepstral mean normalisation, HMM based tone classification, training data reduction, adaptive training, more detailed sub-syllable modeling and a statistical language model.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary but limited training data

This correspondence presents the first known results of complete recognition of continuous Mandarin speech for the Chinese language with very large vocabulary but very limited training data. Various acoustic and linguistic processing techniques were developed, and a prototype system of a continuous speech Mandarin dictation machine has been successfully implemented. The best recognition accurac...

متن کامل

Phonetic state tied-mixture tone modeling for large vocabulary continuous Mandarin speech recognition

This paper presents a new approach to tone modeling for continuous Mandarin speech recognition. Mandarin tones provide rich information for speech recognition. In this paper, we treat the tone as an attribute of the final vowel part of a Mandarin syllable. Separate distributions are estimated for cepstral coefficients and pitch features respectively, and the phonetic state tied-mixture techniqu...

متن کامل

Golden Mandarin (I)-A real-time Mandarin speech dictation machine for Chinese language with very large vocabulary

AhtractThis paper describes the first successfully implemented real-time Mandarin dictation machine developed in the world which recognizes Mandarin speech with very large vocabulary and almost unlimited texts for the input of Chinese characters into computers. Considering the special characteristics of the Chinese language, syllables are chosen as the basic units for dictation. The machine is ...

متن کامل

A multi-pass error detection and correction framework for Mandarin LVCSR

We previously proposed a multi-pass framework for Large Vocabulary Continuous Speech Recognition (LVCSR). The objective of this framework is to apply sophisticated linguistic models for recognition, while maintaining a balance between complexity and efficiency. The framework is composed of three passes: initial recognition, error detection and error correction. This paper presents and evaluates...

متن کامل

A Survey on Automatic Speech Recognition with an Illustrative Example on Continuous Speech Recognition of Mandarin

For the past two decades, research in speech recognition has been intensively carried out worldwide, spurred on by advances in signal processing, algorithms, architectures, and hardware. Speech recognition systems have been developed for a wide variety of applications, ranging from small vocabulary keyword recognition over dial-up telephone lines, to medium size vocabulary voice interactive com...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995